Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 24
Filtrar
1.
Cancers (Basel) ; 16(7)2024 Mar 29.
Artigo em Inglês | MEDLINE | ID: mdl-38611028

RESUMO

Topic modeling is a popular technique in machine learning and natural language processing, where a corpus of text documents is classified into themes or topics using word frequency analysis. This approach has proven successful in various biological data analysis applications, such as predicting cancer subtypes with high accuracy and identifying genes, enhancers, and stable cell types simultaneously from sparse single-cell epigenomics data. The advantage of using a topic model is that it not only serves as a clustering algorithm, but it can also explain clustering results by providing word probability distributions over topics. Our study proposes a novel topic modeling approach for clustering single cells and detecting topics (gene signatures) in single-cell datasets that measure multiple omics simultaneously. We applied this approach to examine the transcriptional heterogeneity of luminal and triple-negative breast cancer cells using patient-derived xenograft models with acquired resistance to chemotherapy and targeted therapy. Through this approach, we identified protein-coding genes and long non-coding RNAs (lncRNAs) that group thousands of cells into biologically similar clusters, accurately distinguishing drug-sensitive and -resistant breast cancer types. In comparison to standard state-of-the-art clustering analyses, our approach offers an optimal partitioning of genes into topics and cells into clusters simultaneously, producing easily interpretable clustering outcomes. Additionally, we demonstrate that an integrative clustering approach, which combines the information from mRNAs and lncRNAs treated as disjoint omics layers, enhances the accuracy of cell classification.

2.
PLoS One ; 19(3): e0289699, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38512819

RESUMO

MicroRNAs (miRNAs) are small molecules that play an essential role in regulating gene expression by post-transcriptional gene silencing. Their study is crucial in revealing the fundamental processes underlying pathologies and, in particular, cancer. To date, most studies on miRNA regulation consider the effect of specific miRNAs on specific target mRNAs, providing wet-lab validation. However, few tools have been developed to explain the miRNA-mediated regulation at the protein level. In this paper, the MoPC computational tool is presented, that relies on the partial correlation between mRNAs and proteins conditioned on the miRNA expression to predict miRNA-target interactions in multi-omic datasets. MoPC returns the list of significant miRNA-target interactions and plot the significant correlations on the heatmap in which the miRNAs and targets are ordered by the chromosomal location. The software was applied on three TCGA/CPTAC datasets (breast, glioblastoma, and lung cancer), returning enriched results in three independent targets databases.


Assuntos
MicroRNAs , Neoplasias , Humanos , MicroRNAs/genética , MicroRNAs/metabolismo , Proteoma/genética , Proteoma/metabolismo , Neoplasias/genética , Software , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Biologia Computacional/métodos , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica
3.
NPJ Syst Biol Appl ; 10(1): 8, 2024 Jan 19.
Artigo em Inglês | MEDLINE | ID: mdl-38242871

RESUMO

The efficiency of analyzing high-throughput data in systems biology has been demonstrated in numerous studies, where molecular data, such as transcriptomics and proteomics, offers great opportunities for understanding the complexity of biological processes. One important aspect of data analysis in systems biology is the shift from a reductionist approach that focuses on individual components to a more integrative perspective that considers the system as a whole, where the emphasis shifted from differential expression of individual genes to determining the activity of gene sets. Here, we present the rROMA software package for fast and accurate computation of the activity of gene sets with coordinated expression. The rROMA package incorporates significant improvements in the calculation algorithm, along with the implementation of several functions for statistical analysis and visualizing results. These additions greatly expand the package's capabilities and offer valuable tools for data analysis and interpretation. It is an open-source package available on github at: www.github.com/sysbio-curie/rROMA . Based on publicly available transcriptomic datasets, we applied rROMA to cystic fibrosis, highlighting biological mechanisms potentially involved in the establishment and progression of the disease and the associated genes. Results indicate that rROMA can detect disease-related active signaling pathways using transcriptomic and proteomic data. The results notably identified a significant mechanism relevant to cystic fibrosis, raised awareness of a possible bias related to cell culture, and uncovered an intriguing gene that warrants further investigation.


Assuntos
Fibrose Cística , Proteômica , Humanos , Proteômica/métodos , Perfilação da Expressão Gênica/métodos , Transcriptoma/genética , Biologia de Sistemas/métodos
4.
BMC Bioinformatics ; 24(1): 443, 2023 Nov 22.
Artigo em Inglês | MEDLINE | ID: mdl-37993778

RESUMO

Messenger RNA (mRNA) has an essential role in the protein production process. Predicting mRNA expression levels accurately is crucial for understanding gene regulation, and various models (statistical and neural network-based) have been developed for this purpose. A few models predict mRNA expression levels from the DNA sequence, exploiting the DNA sequence and gene features (e.g., number of exons/introns, gene length). Other models include information about long-range interaction molecules (i.e., enhancers/silencers) and transcriptional regulators as predictive features, such as transcription factors (TFs) and small RNAs (e.g., microRNAs - miRNAs). Recently, a convolutional neural network (CNN) model, called Xpresso, has been proposed for mRNA expression level prediction leveraging the promoter sequence and mRNAs' half-life features (gene features). To push forward the mRNA level prediction, we present miREx, a CNN-based tool that includes information about miRNA targets and expression levels in the model. Indeed, each miRNA can target specific genes, and the model exploits this information to guide the learning process. In detail, not all miRNAs are included, only a selected subset with the highest impact on the model. MiREx has been evaluated on four cancer primary sites from the genomics data commons (GDC) database: lung, kidney, breast, and corpus uteri. Results show that mRNA level prediction benefits from selected miRNA targets and expression information. Future model developments could include other transcriptional regulators or be trained with proteomics data to infer protein levels.


Assuntos
MicroRNAs , MicroRNAs/genética , RNA Mensageiro/genética , Mirex , Regulação da Expressão Gênica , Fatores de Transcrição/genética , Perfilação da Expressão Gênica
5.
Cell Rep Med ; 4(2): 100919, 2023 02 21.
Artigo em Inglês | MEDLINE | ID: mdl-36706754

RESUMO

X-linked chronic granulomatous disease (CGD) is associated with defective phagocytosis, life-threatening infections, and inflammatory complications. We performed a clinical trial of lentivirus-based gene therapy in four patients (NCT02757911). Two patients show stable engraftment and clinical benefits, whereas the other two have progressively lost gene-corrected cells. Single-cell transcriptomic analysis reveals a significantly lower frequency of hematopoietic stem cells (HSCs) in CGD patients, especially in the two patients with defective engraftment. These two present a profound change in HSC status, a high interferon score, and elevated myeloid progenitor frequency. We use elastic-net logistic regression to identify a set of 51 interferon genes and transcription factors that predict the failure of HSC engraftment. In one patient, an aberrant HSC state with elevated CEBPß expression drives HSC exhaustion, as demonstrated by low repopulation in a xenotransplantation model. Targeted treatments to protect HSCs, coupled to targeted gene expression screening, might improve clinical outcomes in CGD.


Assuntos
Doença Granulomatosa Crônica , Transplante de Células-Tronco Hematopoéticas , Humanos , Terapia Genética/efeitos adversos , Doença Granulomatosa Crônica/diagnóstico , Doença Granulomatosa Crônica/genética , Doença Granulomatosa Crônica/terapia , Células-Tronco Hematopoéticas/metabolismo , Inflamação/metabolismo , Interferons/metabolismo
6.
Adv Exp Med Biol ; 1385: 259-279, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36352218

RESUMO

In recent cancer genomics programs, large-scale profiling of microRNAs has been routinely used in order to better understand the role of microRNAs in gene regulation and disease. To support the analysis of such amount of data, scalability of bioinformatics pipelines is increasingly important to handle larger datasets.Here, we describe a scalable implementation of the clustered miRNA Master Regulator Analysis (clustMMRA) pipeline, developed to search for genomic clusters of microRNAs potentially driving cancer molecular subtyping. Genomically clustered microRNAs can be simultaneously expressed to work in a combined manner and jointly regulate cell phenotypes. However, the majority of computational approaches for the identification of microRNA master regulators are typically designed to detect the regulatory effect of a single microRNA.We have applied the clustMMRA pipeline to multiple pediatric tumor datasets, up to a hundred samples in size, demonstrating very satisfying performances of the software on large datasets. Results have highlighted genomic clusters of microRNAs potentially involved in several subgroups of the different pediatric cancers or specifically involved in the phenotype of a subgroup. In particular, we confirmed the cluster of microRNAs at the 14q32 locus to be involved in multiple pediatric cancers, showing its specific downregulation in tumor subgroups with aggressive phenotype.


Assuntos
MicroRNAs , Neoplasias , Humanos , MicroRNAs/genética , Perfilação da Expressão Gênica/métodos , Neoplasias/genética , Análise por Conglomerados , Regulação da Expressão Gênica , Biologia Computacional , Regulação Neoplásica da Expressão Gênica
7.
Cancers (Basel) ; 13(16)2021 Aug 05.
Artigo em Inglês | MEDLINE | ID: mdl-34439123

RESUMO

The identification of miRNAs' targets and associated regulatory networks might allow the definition of new strategies using drugs whose association mimics a given miRNA's effects. Based on this assumption we devised a multi-omics approach to precisely characterize miRNAs' effects. We combined miR-491-5p target affinity purification, RNA microarray, and mass spectrometry to perform an integrated analysis in ovarian cancer cell lines. We thus constructed an interaction network that highlighted highly connected hubs being either direct or indirect targets of miR-491-5p effects: the already known EGFR and BCL2L1 but also EP300, CTNNB1 and several small-GTPases. By using different combinations of specific inhibitors of these hubs, we could greatly enhance their respective cytotoxicity and mimic the miR-491-5p-induced phenotype. Our methodology thus constitutes an interesting strategy to comprehensively study the effects of a given miRNA. Moreover, we identified targets for which pharmacological inhibitors are already available for a clinical use or in clinical trials. This study might thus enable innovative therapeutic options for ovarian cancer, which remains the leading cause of death from gynecological malignancies in developed countries.

8.
Nat Biotechnol ; 39(9): 1095-1102, 2021 09.
Artigo em Inglês | MEDLINE | ID: mdl-33927417

RESUMO

Because of the stochasticity associated with high-throughput single-cell sequencing, current methods for exploring cell-type diversity rely on clustering-based computational approaches in which heterogeneity is characterized at cell subpopulation rather than at full single-cell resolution. Here we present Cell-ID, a clustering-free multivariate statistical method for the robust extraction of per-cell gene signatures from single-cell sequencing data. We applied Cell-ID to data from multiple human and mouse samples, including blood cells, pancreatic islets and airway, intestinal and olfactory epithelium, as well as to comprehensive mouse cell atlas datasets. We demonstrate that Cell-ID signatures are reproducible across different donors, tissues of origin, species and single-cell omics technologies, and can be used for automatic cell-type annotation and cell matching across datasets. Cell-ID improves biological interpretation at individual cell level, enabling discovery of previously uncharacterized rare cell types or cell states. Cell-ID is distributed as an open-source R software package.


Assuntos
Biologia Computacional/métodos , Perfilação da Expressão Gênica , Análise de Célula Única/métodos , Animais , Análise por Conglomerados , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Camundongos , Análise Multivariada
9.
Genome Biol ; 21(1): 302, 2020 12 14.
Artigo em Inglês | MEDLINE | ID: mdl-33317623

RESUMO

BACKGROUND: Tumor-specific genomic aberrations are routinely determined by high-throughput genomic measurements. It remains unclear how complex genome alterations affect molecular networks through changing protein levels and consequently biochemical states of tumor tissues. RESULTS: Here, we investigate the propagation of genomic effects along the axis of gene expression during prostate cancer progression. We quantify genomic, transcriptomic, and proteomic alterations based on 105 prostate samples, consisting of benign prostatic hyperplasia regions and malignant tumors, from 39 prostate cancer patients. Our analysis reveals the convergent effects of distinct copy number alterations impacting on common downstream proteins, which are important for establishing the tumor phenotype. We devise a network-based approach that integrates perturbations across different molecular layers, which identifies a sub-network consisting of nine genes whose joint activity positively correlates with increasingly aggressive tumor phenotypes and is predictive of recurrence-free survival. Further, our data reveal a wide spectrum of intra-patient network effects, ranging from similar to very distinct alterations on different molecular layers. CONCLUSIONS: This study uncovers molecular networks with considerable convergent alterations across tumor sites and patients. It also exposes a diversity of network effects: we could not identify a single sub-network that is perturbed in all high-grade tumor regions.


Assuntos
Progressão da Doença , Regulação Neoplásica da Expressão Gênica , Neoplasias da Próstata/genética , Biomarcadores Tumorais/genética , Variações do Número de Cópias de DNA , Heterogeneidade Genética , Genômica , Humanos , Masculino , Mutação , Fenótipo , Próstata/patologia , Proteogenômica , Proteoma , Proteômica , RNA Mensageiro , Transcriptoma
10.
Int J Mol Sci ; 21(18)2020 Sep 10.
Artigo em Inglês | MEDLINE | ID: mdl-32927759

RESUMO

Background: The prevalence of chronic kidney disease is increased in patients with cystic fibrosis (CF). The study of urinary exosomal proteins might provide insight into the pathophysiology of CF kidney disease. Methods: Urine samples were collected from 19 CF patients (among those 7 were treated by cystic fibrosis transmembrane conductance regulator (CFTR) modulators), and 8 healthy subjects. Urine exosomal protein content was determined by high resolution mass spectrometry. Results: A heatmap of the differentially expressed proteins in urinary exosomes showed a clear separation between control and CF patients. Seventeen proteins were upregulated in CF patients (including epidermal growth factor receptor (EGFR); proteasome subunit beta type-6, transglutaminases, caspase 14) and 118 were downregulated (including glutathione S-transferases, superoxide dismutase, klotho, endosomal sorting complex required for transport, and matrisome proteins). Gene set enrichment analysis revealed 20 gene sets upregulated and 74 downregulated. Treatment with CFTR modulators yielded no significant modification of the proteomic content. These results highlight that CF kidney cells adapt to the CFTR defect by upregulating proteasome activity and that autophagy and endosomal targeting are impaired. Increased expression of EGFR and decreased expression of klotho and matrisome might play a central role in this CF kidney signature by inducing oxidation, inflammation, accelerated senescence, and abnormal tissue repair. Conclusions: Our study unravels novel insights into consequences of CFTR dysfunction in the urinary tract, some of which may have clinical and therapeutic implications.


Assuntos
Fibrose Cística/urina , Exossomos/metabolismo , Nefropatias/urina , Adolescente , Adulto , Aminofenóis/uso terapêutico , Aminopiridinas/uso terapêutico , Benzodioxóis/uso terapêutico , Estudos de Casos e Controles , Criança , Pré-Escolar , Fibrose Cística/complicações , Fibrose Cística/tratamento farmacológico , Combinação de Medicamentos , Humanos , Indóis/uso terapêutico , Nefropatias/etiologia , Proteoma , Quinolonas/uso terapêutico , Adulto Jovem
11.
iScience ; 21: 664-680, 2019 Nov 22.
Artigo em Inglês | MEDLINE | ID: mdl-31733513

RESUMO

Here we describe a proteomic data resource for the NCI-60 cell lines generated by pressure cycling technology and SWATH mass spectrometry. We developed the DIA-expert software to curate and visualize the SWATH data, leading to reproducible detection of over 3,100 SwissProt proteotypic proteins and systematic quantification of pathway activities. Stoichiometric relationships of interacting proteins for DNA replication, repair, the chromatin remodeling NuRD complex, ß-catenin, RNA metabolism, and prefoldins are more evident than that at the mRNA level. The data are available in CellMiner (discover.nci.nih.gov/cellminercdb and discover.nci.nih.gov/cellminer), allowing casual users to test hypotheses and perform integrative, cross-database analyses of multi-omic drug response correlations for over 20,000 drugs. We demonstrate the value of proteome data in predicting drug response for over 240 clinically relevant chemotherapeutic and targeted therapies. In summary, we present a novel proteome resource for the NCI-60, together with relevant software tools, and demonstrate the benefit of proteome analyses.

12.
Epigenomics ; 11(14): 1581-1599, 2019 11.
Artigo em Inglês | MEDLINE | ID: mdl-31693439

RESUMO

Aim: Growing evidence shows a strong interplay between post-transcriptional regulation, mediated by miRNAs (miRs) and epigenetic regulation. Nevertheless, the number of experimentally validated miRs (called epi-miRs) involved in these regulatory circuitries is still very small. Material & methods: We propose a pipeline to prioritize candidate epi-miRs and to identify potential epigenetic interactors of any given miR starting from miR transfection experiment datasets. Results & conclusion: We identified 34 candidate epi-miRs: 19 of them are known epi-miRs, while 15 are new. Moreover, using an in-house generated gene expression dataset, we experimentally proved that a component of the polycomb-repressive complex 2, the histone methyltransferase enhancer of zeste homolog 2 (EZH2), interacts with miR-214, a well-known prometastatic miR in melanoma and breast cancer, highlighting a miR-214-EZH2 regulatory axis potentially relevant in tumor progression.


Assuntos
Epigênese Genética/genética , MicroRNAs/genética , Neoplasias da Mama/genética , Linhagem Celular Tumoral , Proteína Potenciadora do Homólogo 2 de Zeste/genética , Feminino , Regulação Neoplásica da Expressão Gênica/genética , Humanos , Melanoma/genética , Complexo Repressor Polycomb 2/genética , Transfecção/métodos
13.
Nucleic Acids Res ; 47(5): 2205-2215, 2019 03 18.
Artigo em Inglês | MEDLINE | ID: mdl-30657980

RESUMO

MicroRNAs play important roles in many biological processes. Their aberrant expression can have oncogenic or tumor suppressor function directly participating to carcinogenesis, malignant transformation, invasiveness and metastasis. Indeed, miRNA profiles can distinguish not only between normal and cancerous tissue but they can also successfully classify different subtypes of a particular cancer. Here, we focus on a particular class of transcripts encoding polycistronic miRNA genes that yields multiple miRNA components. We describe 'clustered MiRNA Master Regulator Analysis (ClustMMRA)', a fully redesigned release of the MMRA computational pipeline (MiRNA Master Regulator Analysis), developed to search for clustered miRNAs potentially driving cancer molecular subtyping. Genomically clustered miRNAs are frequently co-expressed to target different components of pro-tumorigenic signaling pathways. By applying ClustMMRA to breast cancer patient data, we identified key miRNA clusters driving the phenotype of different tumor subgroups. The pipeline was applied to two independent breast cancer datasets, providing statistically concordant results between the two analyses. We validated in cell lines the miR-199/miR-214 as a novel cluster of miRNAs promoting the triple negative breast cancer (TNBC) phenotype through its control of proliferation and EMT.


Assuntos
Transição Epitelial-Mesenquimal/genética , MicroRNAs/genética , Família Multigênica/genética , Neoplasias de Mama Triplo Negativas/genética , Neoplasias de Mama Triplo Negativas/patologia , Linhagem Celular Tumoral , Proliferação de Células , Conjuntos de Dados como Assunto , Inativação Gênica , Humanos , Invasividade Neoplásica/genética , Reprodutibilidade dos Testes , Neoplasias de Mama Triplo Negativas/classificação
14.
Brief Bioinform ; 20(4): 1238-1249, 2019 07 19.
Artigo em Inglês | MEDLINE | ID: mdl-29237040

RESUMO

Mathematical models can serve as a tool to formalize biological knowledge from diverse sources, to investigate biological questions in a formal way, to test experimental hypotheses, to predict the effect of perturbations and to identify underlying mechanisms. We present a pipeline of computational tools that performs a series of analyses to explore a logical model's properties. A logical model of initiation of the metastatic process in cancer is used as a transversal example. We start by analysing the structure of the interaction network constructed from the literature or existing databases. Next, we show how to translate this network into a mathematical object, specifically a logical model, and how robustness analyses can be applied to it. We explore the visualization of the stable states, defined as specific attractors of the model, and match them to cellular fates or biological read-outs. With the different tools we present here, we explain how to assign to each solution of the model a probability and how to identify genetic interactions using mutant phenotype probabilities. Finally, we connect the model to relevant experimental data: we present how some data analyses can direct the construction of the network, and how the solutions of a mathematical model can also be compared with experimental data, with a particular focus on high-throughput data in cancer biology. A step-by-step tutorial is provided as a Supplementary Material and all models, tools and scripts are provided on an accompanying website: https://github.com/sysbio-curie/Logical_modelling_pipeline.


Assuntos
Modelos Biológicos , Transdução de Sinais , Biologia Computacional/métodos , Simulação por Computador , Bases de Dados Factuais , Doença , Epistasia Genética , Redes Reguladoras de Genes , Humanos , Modelos Logísticos , Conceitos Matemáticos , Redes e Vias Metabólicas , Mutação , Metástase Neoplásica/genética , Metástase Neoplásica/patologia , Metástase Neoplásica/fisiopatologia , Software , Biologia de Sistemas/estatística & dados numéricos
15.
Cancer Cell ; 34(3): 379-395.e7, 2018 09 10.
Artigo em Inglês | MEDLINE | ID: mdl-30205043

RESUMO

The current consensus recognizes four main medulloblastoma subgroups (wingless, Sonic hedgehog, group 3 and group 4). While medulloblastoma subgroups have been characterized extensively at the (epi-)genomic and transcriptomic levels, the proteome and phosphoproteome landscape remain to be comprehensively elucidated. Using quantitative (phospho)-proteomics in primary human medulloblastomas, we unravel distinct posttranscriptional regulation leading to highly divergent oncogenic signaling and kinase activity profiles in groups 3 and 4 medulloblastomas. Specifically, proteomic and phosphoproteomic analyses identify aberrant ERBB4-SRC signaling in group 4. Hence, enforced expression of an activated SRC combined with p53 inactivation induces murine tumors that resemble group 4 medulloblastoma. Therefore, our integrative proteogenomics approach unveils an oncogenic pathway and potential therapeutic vulnerability in the most common medulloblastoma subgroup.


Assuntos
Neoplasias Cerebelares/patologia , Meduloblastoma/patologia , Receptor ErbB-4/metabolismo , Quinases da Família src/metabolismo , Adolescente , Animais , Carcinogênese/patologia , Linhagem Celular Tumoral , Neoplasias Cerebelares/genética , Cerebelo/patologia , Criança , Pré-Escolar , Feminino , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Humanos , Lactente , Masculino , Meduloblastoma/genética , Camundongos , Camundongos Transgênicos , Fosforilação , Proteoma/metabolismo , Proteômica/métodos , Transdução de Sinais , Quinases da Família src/genética
16.
NPJ Syst Biol Appl ; 4: 2, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-29263798

RESUMO

Gene signatures are more and more used to interpret results of omics data analyses but suffer from compositional (large overlap) and functional (correlated read-outs) redundancy. Moreover, many gene signatures rarely come out as significant in statistical tests. Based on pan-cancer data analysis, we construct a restricted set of 962 signatures defined as informative and demonstrate that they have a higher probability to appear enriched in comparative cancer studies. We show that the majority of informative signatures conserve their weights for the genes composing the signature (eigengenes) from one cancer type to another. We finally construct InfoSigMap, an interactive online map of these signatures and their cross-correlations. This map highlights the structure of compositional and functional redundancies between informative signatures, and it charts the territories of biological functions. InfoSigMap can be used to visualize the results of omics data analyses and suggests a rearrangement of existing gene sets.

17.
Front Biosci (Landmark Ed) ; 22(10): 1774-1791, 2017 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-28410145

RESUMO

MicroRNAs (miRNAs) are small non-coding RNAs playing an essential role in gene expression regulation. Multiple studies have demonstrated that miRNAs are dysregulated in cancer initiation and progression, pointing out their potential as biomarkers for diagnosis, prognosis and response to treatment. With the introduction of high-throughput technologies several computational approaches have been proposed to identify cancer-associated miRNAs. Here, we present a systematic and comprehensive overview of the current knowledge concerning the computational detection of miRNAs involved in tumor onset and subtyping, with possible theranostic employment. An overview of the state of art in this field is thus proposed with the aim of supporting researchers, especially experimentalists and pathologists, in choosing the optimal approach for their case of study.


Assuntos
Biologia Computacional/métodos , Regulação Neoplásica da Expressão Gênica , MicroRNAs/genética , Neoplasias/genética , Biomarcadores Tumorais/genética , Progressão da Doença , Humanos , Neoplasias/diagnóstico , Prognóstico , RNA Mensageiro/genética , Fatores de Transcrição/genética
18.
Front Genet ; 7: 18, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-26925094

RESUMO

In many analyses of high-throughput data in systems biology, there is a need to quantify the activity of a set of genes in individual samples. A typical example is the case where it is necessary to estimate the activity of a transcription factor (which is often not directly measurable) from the expression of its target genes. We present here ROMA (Representation and quantification Of Module Activities) Java software, designed for fast and robust computation of the activity of gene sets (or modules) with coordinated expression. ROMA activity quantification is based on the simplest uni-factor linear model of gene regulation that approximates the expression data of a gene set by its first principal component. The proposed algorithm implements novel functionalities: it provides several method modifications for principal components computation, including weighted, robust and centered methods; it distinguishes overdispersed modules (based on the variance explained by the first principal component) and coordinated modules (based on the significance of the spectral gap); finally, it computes statistical significance of the estimated module overdispersion or coordination. ROMA can be applied in many contexts, from estimating differential activities of transcriptional factors to finding overdispersed pathways in single-cell transcriptomics data. We describe here the principles of ROMA providing several practical examples of its use. ROMA source code is available at https://github.com/sysbio-curie/Roma.

19.
PLoS Comput Biol ; 11(11): e1004571, 2015 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-26528548

RESUMO

Understanding the etiology of metastasis is very important in clinical perspective, since it is estimated that metastasis accounts for 90% of cancer patient mortality. Metastasis results from a sequence of multiple steps including invasion and migration. The early stages of metastasis are tightly controlled in normal cells and can be drastically affected by malignant mutations; therefore, they might constitute the principal determinants of the overall metastatic rate even if the later stages take long to occur. To elucidate the role of individual mutations or their combinations affecting the metastatic development, a logical model has been constructed that recapitulates published experimental results of known gene perturbations on local invasion and migration processes, and predict the effect of not yet experimentally assessed mutations. The model has been validated using experimental data on transcriptome dynamics following TGF-ß-dependent induction of Epithelial to Mesenchymal Transition in lung cancer cell lines. A method to associate gene expression profiles with different stable state solutions of the logical model has been developed for that purpose. In addition, we have systematically predicted alleviating (masking) and synergistic pairwise genetic interactions between the genes composing the model with respect to the probability of acquiring the metastatic phenotype. We focused on several unexpected synergistic genetic interactions leading to theoretically very high metastasis probability. Among them, the synergistic combination of Notch overexpression and p53 deletion shows one of the strongest effects, which is in agreement with a recent published experiment in a mouse model of gut cancer. The mathematical model can recapitulate experimental mutations in both cell line and mouse models. Furthermore, the model predicts new gene perturbations that affect the early steps of metastasis underlying potential intervention points for innovative therapeutic strategies in oncology.


Assuntos
Transição Epitelial-Mesenquimal/genética , Modelos Biológicos , Invasividade Neoplásica/genética , Metástase Neoplásica/genética , Animais , Neoplasias do Colo/genética , Neoplasias do Colo/metabolismo , Biologia Computacional , Bases de Dados Genéticas , Humanos , Camundongos , Camundongos Transgênicos , Mutação
20.
BMC Genomics ; 16: S4, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26046581

RESUMO

Identifying key microRNAs (miRNAs) contributing to the genesis and development of a particular disease is a focus of many recent studies. We introduce here a rank-based algorithm to detect miRNA regulatory activity in cancer-derived tissue samples which combines measurements of gene and miRNA expression levels and sequence-based target predictions. The method is designed to detect modest but coordinated changes in the expression of sequence-based predicted target genes. We applied our algorithm to a cohort of 129 tumour and healthy breast tissues and showed its effectiveness in identifying functional miRNAs possibly involved in the disease. These observations have been validated using an independent publicly available breast cancer dataset from The Cancer Genome Atlas. We focused on the triple negative breast cancer subtype to highlight potentially relevant miRNAs in this tumour subtype. For those miRNAs identified as potential regulators, we characterize the function of affected target genes by enrichment analysis. In the two independent datasets, the affected targets are not necessarily the same, but display similar enriched categories, including breast cancer related processes like cell substrate adherens junction, regulation of cell migration, nuclear pore complex and integrin pathway. The R script implementing our method together with the datasets used in the study can be downloaded here (http://bioinfo-out.curie.fr/projects/targetrunningsum).


Assuntos
MicroRNAs/metabolismo , Transcriptoma , Neoplasias de Mama Triplo Negativas/genética , Interface Usuário-Computador , Regiões 3' não Traduzidas , Algoritmos , Feminino , Humanos , Internet , Neoplasias de Mama Triplo Negativas/patologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA